153 research outputs found

    HoloDetect: Few-Shot Learning for Error Detection

    Full text link
    We introduce a few-shot learning framework for error detection. We show that data augmentation (a form of weak supervision) is key to training high-quality, ML-based error detection models that require minimal human involvement. Our framework consists of two parts: (1) an expressive model to learn rich representations that capture the inherent syntactic and semantic heterogeneity of errors; and (2) a data augmentation model that, given a small seed of clean records, uses dataset-specific transformations to automatically generate additional training data. Our key insight is to learn data augmentation policies from the noisy input dataset in a weakly supervised manner. We show that our framework detects errors with an average precision of ~94% and an average recall of ~93% across a diverse array of datasets that exhibit different types and amounts of errors. We compare our approach to a comprehensive collection of error detection methods, ranging from traditional rule-based methods to ensemble-based and active learning approaches. We show that data augmentation yields an average improvement of 20 F1 points while it requires access to 3x fewer labeled examples compared to other ML approaches.Comment: 18 pages

    Towards Emotion Recognition: A Persistent Entropy Application

    Full text link
    Emotion recognition and classification is a very active area of research. In this paper, we present a first approach to emotion classification using persistent entropy and support vector machines. A topology-based model is applied to obtain a single real number from each raw signal. These data are used as input of a support vector machine to classify signals into 8 different emotions (calm, happy, sad, angry, fearful, disgust and surprised)

    Moderate exercise may attenuate some aspects of immunosenescence

    Get PDF
    BACKGROUND: Immunosenescence is related to the deterioration of many immune functions, which may be manifested in increased susceptibility to infection, cancer, and autoimmunity. Lifestyle factors, such as diet or physical activity, may influence the senescence of the immune system. It is widely accepted that moderate physical activity may cause beneficial effects for physical and psychological health as well as for the immune system activity in aged people. METHODS: Thirty elderly women aged 62 to 86 were subjected to a two-years authorized physical activity program. Peripheral blood lymphocytes distribution and the production of cytokines involved in the immune response development and regulation (IL-2, IL-4 and IFN-γ) were investigated. The same parameters were evaluated in two control groups of women: a sedentary group of 12 elderly women selected for the second round of the physical activity program and in a group of 20 sedentary young women. Flow cytometry methods were used for the examination of surface markers on peripheral blood lymphocytes and intracellular cytokines expression. RESULTS: The distribution of the main lymphocytes subpopulations in the peripheral blood of elderly women did not show changes after long-term moderate physical training. The percentage of lymphocytes expressing intracellular IL-2 was higher in the group of women attending 2-years physical activity program than in the control group of elderly sedentary women, and it was similar to the value estimated in the group of young sedentary women. There was no difference in the intracellular expression of IL-4 and IFN-γ between the active and elderly sedentary women. CONCLUSIONS: Our results suggest that moderate, long-term physical activity in elderly women may increase the production of IL-2, an important regulator of the immune response. This may help ameliorate immunosenescence in these women

    Belief Propagation and Loop Series on Planar Graphs

    Full text link
    We discuss a generic model of Bayesian inference with binary variables defined on edges of a planar graph. The Loop Calculus approach of [1, 2] is used to evaluate the resulting series expansion for the partition function. We show that, for planar graphs, truncating the series at single-connected loops reduces, via a map reminiscent of the Fisher transformation [3], to evaluating the partition function of the dimer matching model on an auxiliary planar graph. Thus, the truncated series can be easily re-summed, using the Pfaffian formula of Kasteleyn [4]. This allows to identify a big class of computationally tractable planar models reducible to a dimer model via the Belief Propagation (gauge) transformation. The Pfaffian representation can also be extended to the full Loop Series, in which case the expansion becomes a sum of Pfaffian contributions, each associated with dimer matchings on an extension to a subgraph of the original graph. Algorithmic consequences of the Pfaffian representation, as well as relations to quantum and non-planar models, are discussed.Comment: Accepted for publication in Journal of Statistical Mechanics: theory and experimen

    Second Order Dimensionality Reduction Using Minimum and Maximum Mutual Information Models

    Get PDF
    Conventional methods used to characterize multidimensional neural feature selectivity, such as spike-triggered covariance (STC) or maximally informative dimensions (MID), are limited to Gaussian stimuli or are only able to identify a small number of features due to the curse of dimensionality. To overcome these issues, we propose two new dimensionality reduction methods that use minimum and maximum information models. These methods are information theoretic extensions of STC that can be used with non-Gaussian stimulus distributions to find relevant linear subspaces of arbitrary dimensionality. We compare these new methods to the conventional methods in two ways: with biologically-inspired simulated neurons responding to natural images and with recordings from macaque retinal and thalamic cells responding to naturalistic time-varying stimuli. With non-Gaussian stimuli, the minimum and maximum information methods significantly outperform STC in all cases, whereas MID performs best in the regime of low dimensional feature spaces

    Dynamical Principles of Emotion-Cognition Interaction: Mathematical Images of Mental Disorders

    Get PDF
    The key contribution of this work is to introduce a mathematical framework to understand self-organized dynamics in the brain that can explain certain aspects of itinerant behavior. Specifically, we introduce a model based upon the coupling of generalized Lotka-Volterra systems. This coupling is based upon competition for common resources. The system can be regarded as a normal or canonical form for any distributed system that shows self-organized dynamics that entail winnerless competition. Crucially, we will show that some of the fundamental instabilities that arise in these coupled systems are remarkably similar to endogenous activity seen in the brain (using EEG and fMRI). Furthermore, by changing a small subset of the system's parameters we can produce bifurcations and metastable sequential dynamics changing, which bear a remarkable similarity to pathological brain states seen in psychiatry. In what follows, we will consider the coupling of two macroscopic modes of brain activity, which, in a purely descriptive fashion, we will label as cognitive and emotional modes. Our aim is to examine the dynamical structures that emerge when coupling these two modes and relate them tentatively to brain activity in normal and non-normal states

    A gene encoding an abscisic acid biosynthetic enzyme (LsNCED4) collocates with the high temperature germination locus Htg6.1 in lettuce (Lactuca sp.)

    Get PDF
    Thermoinhibition, or failure of seeds to germinate when imbibed at warm temperatures, can be a significant problem in lettuce (Lactuca sativa L.) production. The reliability of stand establishment would be improved by increasing the ability of lettuce seeds to germinate at high temperatures. Genes encoding germination- or dormancy-related proteins were mapped in a recombinant inbred line population derived from a cross between L. sativa cv. Salinas and L. serriola accession UC96US23. This revealed several candidate genes that are located in the genomic regions containing quantitative trait loci (QTLs) associated with temperature and light requirements for germination. In particular, LsNCED4, a temperature-regulated gene in the biosynthetic pathway for abscisic acid (ABA), a germination inhibitor, mapped to the center of a previously detected QTL for high temperature germination (Htg6.1) from UC96US23. Three sets of sister BC3S2 near-isogenic lines (NILs) that were homozygous for the UC96US23 allele of LsNCED4 at Htg6.1 were developed by backcrossing to cv. Salinas and marker-assisted selection followed by selfing. The maximum temperature for germination of NIL seed lots with the UC96US23 allele at LsNCED4 was increased by 2–3°C when compared with sister NIL seed lots lacking the introgression. In addition, the expression of LsNCED4 was two- to threefold lower in the former NIL lines as compared to expression in the latter. Together, these data strongly implicate LsNCED4 as the candidate gene responsible for the Htg6.1 phenotype and indicate that decreased ABA biosynthesis at high imbibition temperatures is a major factor responsible for the increased germination thermotolerance of UC96US23 seeds
    corecore